# README – BJP and INC Twitter Dataset (2024 Indian General Elections)

## Overview

This dataset contains tweets posted by the official accounts of the Bharatiya Janata Party (BJP) and the Indian National Congress (INC) during the final phase of the 2024 Indian general election campaign. The dataset is intended to support analysis of political communication strategies on the platform X (formerly Twitter), focusing on how platform affordances are leveraged by ruling and opposition parties.

## Contents

- `Cleaned_BJP_Tweets.xlsx`  
  Contains 1666 tweets from the official BJP account.

- `Cleaned_INC_Tweets.xlsx`  
  Contains 1658 tweets from the official INC account.

- `Codebook.docx`  
  Provides definitions and coding instructions for all variables used in the dataset.

## Data Collection

- Source: X (Twitter)
- Accounts:
  - BJP: `@BJP4India`
  - INC: `@INCIndia`
- Collection Period: May 1, 2024 – May 30, 2024
- Scraping Tool: Manual
- Language: Tweets in English; Google translate was applied to translate tweets from Hindi to English along with researcher's oversight.

## Variables (Full details in Codebook)

Each row in the dataset represents a single tweet. The variables include:

- `Date`: Timestamp of tweet
- `Text`: Full tweet content
- `Hashtags`: All hashtags used
- `Mentions`: All user mentions
- `Language`: English
- `Retweets`: Count at time of collection
- `Likes`: Count at time of collection
- `Sentiment`: (Positive/Negative/Neutral)
- `Main_Topic`: (e.g., Leadership, Development, Religion, Criticism)
- `Consent_Code`: One of 20 techniques adapted from the Propaganda Model (e.g., EmotionalAppeal, AuthorityTransfer, EliteQuote, OppositionAttack)

## Citation

If you use this dataset, please cite:

> [Your Name]. (2024). BJP and INC Twitter Campaign Dataset – 2024 Indian General Election. Zenodo. https://doi.org/XXXXXXX

## Contact

For questions or clarifications, please contact:
- Rachna
- Rachna.2022@vitstudent.ac.in